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Abstract 

Our article considers a regression model with observed factors. The observed factors 
have a flexible stochastic volatility structure that has separate dynamics for the volatilities 
and the correlation matrix. The correlation matrix of the factors is time-varying and its 
evolution is described by an inverse Wishart process. The model specifies the evolution of 
the observed volatilities flexibly and is particularly attractive when the dimension of the 
observations is high. A Markov chain Monte Carlo algorithm is developed to estimate the 
model. It is straightforward to use this algorithm to obtain the predictive distributions of 
future observations and to carry out model selection. The model is illustrated and compared 
to other Wishart-type factor multivariate stochastic volatility models using various empirical 
data including monthly stock returns and portfolio weighted returns. The evidence suggests 
that our model has better predictive performance. The paper also allows the idiosyncratic 
errors to follow individual stochastic volatility processes in order to deal with more volatile 
data such as daily or weekly stock returns. 
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i 1 Introduction 



2 For the last two decades, multivariate stochastic volatility (MSV) models have been an im- 

3 portant class of models in financial econometrics. Recent developments in this area focus on 

4 dimension reduction since the complexity of computation and t he difficulty in mode l interpre- 



5 tation grow drastically as the dimension of the model increases. 



Harvey et al 



6 first to discuss a factor structure for MSV models. The seminal work by 



1994) we re the 



Jacquier et al 



(jl995l ) 



introduced Bayesian approa ches to the factor MSV (FMSV) literature. Th e FMSV model is also 



consid ered and discussed by 



Pitt and Shephard 



(|l999h . 



Chib et al 



(|2006h . 



Lopes and Carvalho 



9 (|2007l ). among others. A common feature in these FMSV models is that they impose the diag- 

10 onality assumption on the factor correlation or covariance matrices, implying that the factors 

11 are uncorrelated. However, it is often unrealistic to assume that the factors do not interact with 

12 each other, especially when the factors are observed. 

13 To relax the diagonality assumption, iPhilipov and Glickmanl (|2006bl ) introduces a time- 



14 varying FMSV model in which the inverse fact or covariance matrices are driv en by Wishart 

15 processes. The model is a direct application of 



Philipov and Glickmanl (|2006al ) to the factor 



16 structure. The inverse Wishart specification introduced by Philipov and Glickman (2006a, b) has 

17 the attractive property that it can be easily incorporated into model estima tion with Bayesian 
is Marko v chain Monte Carlo (MCMC) methods. Based on a similar setting, 



Asai and McAleer 



19 (j2009j) also proposes an MSV model, where the individual return series is modeled with the 



20 stochastic volatility ( SV) process and the coy ariance process is characterized by the inverse 



2i Wishart distribution. 



Asai and McAleerl (|2009l ) call this type of model a "Wishart Inverse Co- 



22 variance" (WIC) model. 

23 When the vector of d e penden t va riables is high-dim e nsion al, the WIC models proposed 

Jul 



24 by 



Philipov and Glickmanl (|2006al ) and 



Asai and McAleerl (|2009l ) have two problems. First, the 



25 computation becomes highly time-consuming as the dimension increases. Second, the time effect 

26 among the different series is controlled by just one scalar persistence parameter, which is likely to 



2 



be too restrictive in real applications. The factor structure proposed in lPhilipov and Glickman 



(|2006bl ) helps resolve the first problem. However, when it comes to more factors, say, three 
or more, these distinct underlying factors still have to share a common time effect controlled 
by a single persistence parameter; hence, the second problem remains. In order to solve the 
two problems simultaneously, we propose in our article an observed dynamic-correlation FMSV 
model (O-DCFMSV). 



The basic model form of O-DCFMSV is similar to that of 



Asai and McAleeri (120091) . but 



the st ructure is applied to a factor model. Consequently, compared to 



Asai and McAleer 



(|2009l ). the O-DCFMSV model has advantages in both model estimation and in interpreta- 
tion for high-dimensional data. Moreover, since in the O-DCFMSV model we allow differ- 
ent time effects on the fact o rs thro ugh separate SV processes, it is more flexible compared to 



the 



Philipov and Glickman! (|2006bl ). To estimate the model, we develop an MCMC algorithm 



13 that deals with all unknown parameters and latent variable s joint ly, which is quite different 



from the partial MCMC approach used in 



Asai and McAleeri (|2009l ). Using our approach, pre- 



15 diction and model sele c tion be co me straightforward, issues that are not dealt with in either 



Philipov and Glickmanl (|2006bl ) or 



Asai and McAleeri (|2009l ). We illustrate how to implement 



the one-step-ahead prediction, by which we can forecast many quantities of interest, such as 
the return series, the return covariance matrix, the correlation matrix of the factors, and the 
value at risk (VaR) of a portfolio. We can also conduct model selection based on predictive 
performance. 

To summarize, the contribution of our paper is twofold. First, it introduces a flexible factor 
model to the MSV literature. Secondly, the MCMC algorithm designed in this paper can be 
used for prediction and model selection, which significantly extends the usefulness of the WIC 
models in real problems. 

The remainder of the paper is organized as follows. Section [2] presents the model and the 
MCMC algorithm to estimate the model. Section [3] conducts a simulation study to illustrate 
the model. Section [J] provides two empirical examples. The O-DCFMSV model is applied to 



1 portfolio and stock return data and is compared to the model of lPhilipov and Glickmanl ((2006b) 

2 based on the quality of one-step- ahead predictions. Section extends the model to the case 

3 where the idiosyncratic error terms are allowed to follow independent SV processes. Section [U] 

4 concludes the paper. 

5 2 The Model 

e 2.1 Model Specification 

7 Suppose that at time t we have p asset returns, yt, and q underlying observed factors, ft, such 
s that 

y t = Bf t + e t , (1) 

where the {ft,t > 1} and {et,t > 1} are independent stochastic processes. The et are also 
assumed to be independent with e t ~ N p (et\0, 0), Q = diag(<7 2 , a'l), where N p (X\ti,'£) is 
a p-dimensional multivariate normal density in X with mean fi and covariance matrix XI. The 
assumption that the conditional variance of et is constant is relaxed in Section [5] to allow et to 
have SV dynamics. The model for the factors is as follows: 

ft = V t 1/2 e t , (2a) 

Vf /2 = diag (> tl/2 , e ht2/2 , e ht « /2 ) , q < p, (2b) 

h t+ i = [i + 4> o (ht - /it) + Vt, (2c) 
hu ~ Nyhii m, YZ^j^ = 1 ' 2 ' ( 2d ) 

where N(x|/i, a 2 ) is a univariate normal distribution in x with mean ^ and a 2 , and o is the 
elementwise multiplication operator. The stochastic sequences {e t ,t > 1} and {rjt,t > 1} are 



4 



independent with r\ t also an independent sequence and 



e t \P t ~N g (e t |0,E e , t ), (3a) 
f)t ~ Ng^tlOjE^) , = diag^!,...,^) . (3b) 

The covariance matrix is a correlation matrix which is obtained by standardizing the 
q x q stochastic covariance matrix P t so that 

S e , t = (dia g Pt)-^P t (diagP f )-i (4) 



The dynamics of p, and hence £ e> t are given by the stationary autoregressive inverse Wishart 
process 



P^\k,P t ~ l ^^N q {P^\k,S t ), S t 



1 _ d _d 
_ p 2 4 p 2 

AT* ; 



(5) 



1 where W q (X\k, S) is a g x q Wishart density in X with degrees of freedom (df) k > q and 

2 the scale matrix S. The q x g matrix A is a symmetric positive definite matrix parameter, 

3 and d is a scalar parameter that accounts for the memory of the matrix process {Pt}- The 

4 matrix power operation P ^ 2 is defined by a spec tral decomposition. Similarly to Philipov and 



5 Glickman (2006a, b) and 

6 convenience. 



Asai and McAleerl (|2009l ). we set the initial value Pq to be Pq = I q for 



In the WIC context, there are two different ways to define the scale matrix. 



Asai and McAleer 



s (|2009l ) uses the specification ([5]), while Philipov and Glickman (2006b) uses a BEKK-type rep- 
9 resentation 

-l\d 



1 



(6) 



io where A 2 is defined by a Cho lesk y decomposition such th at A = A^{A^)'. In either case 



Philipov and Glickman! (|2006al ) and 



Asai and McAleerl (|2009l ) show that log |P<+i| is a first-order 



5 



1 autoregression with autoregressive parameter d; if d £ (— 1, 1), then this first order autoregressive 

2 process is stationary. We have also conducted simulations that suggest that the whole process 

3 Pt is stationary for d G ( — 1, 1). 



Asai and McAleerl (12009') in 



Asai and McAleer 



4 Although the O-DCFMSV model has some similarities with 

5 model specification, the two models are in fact different in several respects. First, 

6 (|2009 ) adopt the settings (|2ap - (0) to model the return series, while in O-DCFMSV, we ap- 

7 ply the settings to the observed factors. As a matter of fact, this is the main advantage of 

8 O -DCFMSV. When i t com es to a high-dimensional environment, the estimation of the model 

9 of 



Asai and McAleerl (|2009 ) is extremely tedious. The reason is that the model itself is defined 

10 through a sequence of spectral decompositions or singular value decompositions, and the pos- 

11 terior densities of the parameters are complicated and dependent on the data dimension. The 

12 second difference is in the sampling scheme to estimate the model which we discuss below. 



is 2.2 Priors 



There ar e two set of parameters i n the measurement equation ([I]). For CI = diag (a\ , a^) . 



following 



Liesenfeld and Richard! (|2006l ). we assign independent inverse gamma priors for the 



idiosyncratic variances cr|. Specifically, <r| ~ IG((j| \shape = vq/2, scale = vqSq/2), j = 1, —,p. 
In all our analyses, we use vq = 10 and sq = 0.01. This defines a vague prior which i s commonly 



adopted in the literature. For the loading matrix B, following 
the prior given by 



Jacquier et al 



(|1995l ). we choose 



p(B\ft) oc |r2|~ p / 2 etr (~n~ 1 BB''\ 



(7) 



14 where etr(X) means exp(trace(X)). This prior implies that the columns Bi of B are a priori 

15 independent, each with a prior N p (Sj|0, CI), which is uninformative relative to the d ata. 



T 



re priors for the SV parameters are as follows. We adopt the default settings by 



Kim et al 



17 (|1998l ). For the mean \i\ and variance oz^ i = l,..,q, we respectively assume that ^ 
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1 N (/Xj|0, 10) and a 2 i ~ IG(d^ ^5, 0.05). The prior for 0j is a shifted and scaled beta distribution. 

2 Let </>j = 2<pi — 1 where <^>j ~ Beta^ 1 ), 0( 2 )). We choose i^ 1 ) = 20 and <f>^ = 1.5, implying a 

3 prior mean of 2 < />« /(0W + 0( 2 )) - 1 = 0.86. 

4 The priors for the correlation-level parameters are chosen as follow: for A we specify the 

5 prior A -1 ~ W q (A~ 1 \q, q~ 1 I q ), which implies a prior mean of I q ; for d, we choose the vague 
e prior d ~ Unif(d| — 1,1). Finally, for k we set k ~ Aoe _Ao / ( - (?j00 )(/c). Note that the prior for 
7 A; is a truncated exponential distribution with a rate parameter Ao- Throughout the paper we 
s set Ao = 0.02. This implies a prior mean of 50 + q and a prior standard deviation of 50, which 

9 specifies a diffuse prior. 

10 2.3 MCMC Estimation 

11 2.3.1 Joint Distribution 

12 We estimate the model using the MCMC simulation method described below. Let the observed 

13 data Y = {y t } : T x p, F = {f t } : T x q, the log volatilities H = {h t } : T x q, the normalized 

14 factors e = {e t } : T x p, and the sequence of unnormalized covariance matrices P = {Pt,t = 

15 1, . . . , T}. Let u = {uji,i = 1, ...q}, with u-i = {/Xj, (pi, cr v ,i,i = 1, q}, be the parameters of 

16 the volatilities of the factors. 

The joint density of (Y, F, H, e, P, B, ft, uj, A, d, k) is 

p(Y, F, H, e, P, B, n, u, A, d, k) = p(Y\B, F, Sl)p(F\H, e)p(H\u)p(e\P) (8) 

x p(P\A, d, k)p(B\n)p(n)p(A)p(d)p(k), 
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where 



p(Y\B,F,n) = Y[p(Vt\ft,n), (9a) 
t=i 

T 

p(F\H,e)=Hp(f t \h t ,e t ), (9b) 
t=i 

T 

p(h t \w) =p(h\\w) Y[p(ht\h t -i,u;), 

t=2 

Q <? 

p(hi\u) = \\p(hu\ui), p(ht\ht-i,u>) = Y[p(hu\h t -i,i,oji), (9c) 

i=l i=l 
T 

p(e\P,A) = Y[p(e t \P t ), (9d) 

T 

p(P|A, d, fc) = JJp(P t |P t _i, A, d, k). (9e) 
t=l 

1 The densities p(y t \ft,Q) in (|9a|) are given by Eq. (JT]). The densities p(f t \h t , e t ) in (|9b|) are 

2 degenerate and are given by (|2ap . The densities p(/iij|u;j) in ([9c]) are given by (|2dp . and the 

3 densities p(h t i\h t -i t i, u>j) in ([9c|) are given by (f2c|) . The densities p(e(|P() in (|9cl|) are given by 

4 (f3a|) and Q. The densities p(P t \Pt-i,A,d,k) in (f9e|) are given by ([5]). The priors p(B\fl), 

5 p(Q), p(-A), p(d), and p(&) are discussed in the previous section. 

e 2.3.2 Conditional Distributions 

7 We sample from the following conditional distributions. For <t|, we sample from the inverse 

8 gamma distribution: 

p(er?|rest) oc p(a|) • |crj, P, F) 



. o. "0+T , 1 



t=l ^ i=i ' 



(10) 



9 where is the jth element of yt, fu is the ith element of ft, and bji denotes the ijth element 

10 of B. It follows from (|10p that the conditional density of <r| is an inverse gamma with the shape 



8 



"ojsoj + Et=i Vtj ~ ELi bjifti 



parameter Z/0 ^" T and the scale parameter ^ 

The posterior density of B is a matrix variate normal density given by: 

p(B|rest) cxp(B\fl) -p(Y\B,ft,F) 
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3 where £ R = (F'f 1 + I) and yx R = y'FS B . 



We now follow 



Kim et al. 



(|1998l ) and discuss how to sample the SV parameters u> and H. 



First we transform the SV equation (|2b|) into a linear model by taking the logarithm: 



fti — hti + z ti i 



4 where / t * = log(/ 4 2 j + c) and jfo is a log random variable. The scalar c is an "offset" constant 



5 that is set to be 10 5 . Following 



Kim et al. 



()1998l ). the distribution of / t * can be approximated 



by a seven-component normal mixture with the comp onent indicator v ariables s = {su}- Using 



7 the offset mixture integration sampler developed by iKim et al 



(|1998l ). for each i = 1, ..,q we 



8 sample (</>j,o"^j) jointly in one block marginalized over /ij and H and then in another block 

9 sample (fii,H) conditional on the rest in the mode l. To save c ompu tational cost, we do not 



impose the additional reweighting step introduced in 



Kim et al 



(1998) 



—1/2 

n Given that H is drawn, we can then obtain e% = V t / t to estimate the correlations 
12 and the correlation-level parameters. Now, since the factors are observed a nd we have et, the 



13 estima tion procedure for Pf, A, d, and k is exactly the same as that given in 



Asai and McAleer 



u ((2009) . To sample fro m the complicated and n on-conjugate univariate posterior distributions 

15 of d and k, following 

16 sampling (ARMS) of 



Gilks et al 



Asai and McAleerl (|2009l ). we adopt the adaptive rejection Metropolis 



1995). The complete MCMC procedure is given as follows: 



17 Step 0: Initialize B,Cl,s,u,H,k,d, and A. 

w Step 1: Sample £?|rest, then sample cr||rest for j = 1, ...,p. 



1 Step 2: Sample <fi, crz\F* , s and /x, H\F* , s, (f>, cr^ using the sampler of iKim et ajj (|1998l ). 

— 1/2 

2 Step 3: Obtain the standardized factors et = V t ft from the sample. 

3 Step 4: Sample P t from Purest, and then obtain S ej< = (diagPi)~a P t (diagP t )~2 for t = 1, T 

4 Step 5: Sample A|rest. 

5 Step 6: Sample (i|rest using ARMS, 
e Step 7: Sample A;|rest using ARMS. 
7 Step 8: Go to step 1. 

s Looping steps 1 to 8 is a complete sweep of the MCMC sampler. It is wort h noting that Steps 



Asai and McAleer 



9 2 to 4 link the SV processes to the factor correlations. To deal with this part, 

n 

10 (|2P09) adopt a two-stage procedure. In the first stage they estimate the SV parameters u and 



ii the log-volatilities H in one MCMC procedure and obtain the standardized series eu = Uuft, 



12 with U u = jj YaLi ex P 



2 n ti 



where denotes the Zth draw of the M MCMC iterations. 

13 Then, in the second stage, based on the series e t = (en, et q ), they estimate {Pi} and the 

14 correlation parameters (A, d, k) using another MCMC procedure. Clearly, the strategy does not 

15 conduct the MCMC estimation in a joint sense, which is arguably undesirable and improper in at 

16 least two respects. First, the method obtains the estimates the log-volatilities first and then plug 

17 in the estimates to run another separate MCMC. This manner averages out different samples 
is of volatilities. Therefore, in the inference of the correlation-level parameters, we actually work 

19 with only one fixed set of log- volatilities and residuals e. Secondly, for the purpose of prediction, 

20 we need to sample hf}^ and then obtain for I = 1, M. The plug- in me thod cannot be 



Asai and McAleerl ((2009), our algorithm 



21 applied in this case. Unlike the two-stage scheme of 

22 makes draws for u and H and then directly obtains the standardized series for the correlation 

23 parameters in each single iteration. In this way, we conduct estimation jointly with a full MCMC 



24 procedure, and the prediction can be performed directly using the usual MCMC methods. 
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3 Simulation Study 

3.1 Simulated Data Example 

In this subsection, we use simulated data to illustrate how O-DCFMSV works. Note that the 
illustration is based on a single run since at the very beginning we wish to present the result 
visually. A complete simulation study based on multiple replications is provided later. We set 
p = 10 observed series and q = 2 factors with a sample size T = 1, 000. The true data generating 
process (DGP) is described by: 

(i) Measurement equation: 



B 



\ 



I 1.00 0.30 -0.05 0.99 0.99 -0.10 0.00 0.56 0.00 0.00 
0.00 1.00 0.34 0.00 0.00 0.95 0.95 0.00 0.00 0.30 

ft = diag(0.05, 0.1, 0.13, 0.24, 0.35, 0.35, 0.24, 0.13, 0.1, 0.05). 

(ii) SV structures: 

hi,t+i = Mi + 0M(hit - Mi) + mt, Mi = -0-2, 
h 2)t +i = M2 + 0.98(/i 2t - M2) + mt, M2 = -0.5, 







f 







0.1 2 


\ 













0.27 2 


/ 



(iii) Factor correlation level: 

1 0.05 



1 I 1 



0.05 1 



V 



.003 -0.050 
-0.050 1.003 



,k = 25, d = 0.8, 



From (iii) and the initial value Po = Iq we can simulate a sequence of covariance matrices {Pt}, 
from which we can obtain the correlation matrices S £j t using Eq. Q. Given {S^t} together with 
(ii), we can generate two hidden systematic factors with time- varying correlation pt = [S ej t]2,i- 
Then, given the factors we can generate ten observed series Y with the setting (i). 

The MCMC study is conducted with 20,000 iterations, where the first L = 10, 000 draws are 
taken as burn-ins and the remaining M = 10, 000 are preserved. The program for the estimation 
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1 is coded in OX by iDoornikl (|2007l ). Table [I] summarizes the estimation results. We output the 

2 posterior means and the 95% intervals based on the M draws. The posterior mean is calculated 

3 by averaging the MCMC draws. The 95% credible interval is constructed using the (2.5%, 97.5%) 

4 percentiles of the simulated draws. We can see that, out of the pq + p + 3q + q{q + l)/2 + 2 = 41 

5 parameters, there is only one, 672, not covered by the 95% credible interval. 

6 In O-DCFMSV, one of the primary interests is in capturing the time-varying factor corre- 

7 lation. The factor correlation provides very useful information since it can reflect the market 

8 condition as we will see later. The top panel of Figure [T] displays the correlation fits. The 



9 smoothed estimate at t is calculated by the posterior mean pt = M 1 XytiPe.ilai; where S); 1 



i(9 



i(0 



is the Ith draw of the preserved MCMC iterations based on smoothing. Note that we draw XI 



(0 

c.t 



11 using Step 4 in the algorithm detailed in Section 12.3.21 The grey line is the true correlation 

12 and the black line represents the fits. We can observe that, though the fitted result appears 

13 smoother than the true values, the model in general well captures the pattern of the dynamic 



14 correlations, both t 



le m ovements and the average level. To measure the performance, following 



15 lAsai and McAleerl (|2009i ). we calculate two performance measures, both of which are based upon 

16 the mean absolute error (MAE) of the smoothing estimates. The first is MAE p = ^ Ylt \Pt ~ Pt\i 

17 which measures the quality of the correlation estimates. We obtain the MAEp = 0.208. This 
is suggests a satisfactory result, as we can see that the correlation varies in the (wide) range 

19 (-0.6,0.9). 

20 The second measure is used for evaluating the VaR estimates. This measure is meaningful 

21 to the O-DCFMSV model since one important application of the asset-return factor model 

22 is to obtain the VaR estimates of the portfolios throug h factor structures. S upp ose that we 



23 have a vector of portfolio asset weights w. According to 



Barbieri et al 



(2009) and 



Chib et al 



24 ((2006) , under the assumptions of normality and a zero mean, the 5% VaR of the simulated 

25 portfolio at time t is estimated by 1.645 • ap t , where ap t denotes the posterior mean of the 

, 1 

1/2 1/2 2" 

26 portfolio standard deviation op t = w'(BV t ~S e tV t B' + tt)w . (In fact this is the 95% 



27 quantile of the predictive density when the mean is zero, but by symmetry it is the negative 



12 



of the 5% quantile). Suppose that our asset holdings are equally weighted, which means that 
w = jqI. The true and estimated VaR are shown in the bottom panel of Figure [TJ where the 
grey line represents the true VaR and the black line is the estimate. It is readily seen that 
both the movement and the magnitude are nicely captured. The MAE measure is obtained by 
MAE VaR = i l VaR t St - VaR t I, where VaR t es * = 1.645 • & Pt and VaR° = 1.645 • a Pt . We have 
the result MAEy aR = 0.105, which is quite satisfactory, for the true VaR is fluctuating in a 
range from 0.5 to 1.5. 
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Table 1: MCMC results for the simulated data. TRUE is the true value of the parameter. 
MEAN is the posterior mean. (LOWER, UPPER) are the boundaries of the 95% interval. 





TRUE 


MEAN 


LOWER 


UPPER 




1.000 


0.989 


0.975 


1.003 


B21 


0.300 


0.303 


0.286 


0.321 


B31 


-0.050 


-0.044 


-0.064 


-0.024 


B41 


0.990 


0.969 


0.946 


0.993 


B51 


0.990 


1.009 


0.987 


1.031 


B6i 


-0.100 


-0.080 


-0.107 


-0.053 


B71 


0.000 


0.000 


-0.031 


0.030 


Bsi 


0.560 


0.569 


0.532 


0.605 


B91 


0.000 


-0.016 


-0.052 


0.021 


-Bio,i 


0.000 


0.007 


-0.037 


0.051 


B12 


0.000 


0.012 


-0.024 


0.050 


B22 


1.000 


0.995 


0.950 


1.041 


B32 


0.340 


0.328 


0.296 


0.359 


B42 


0.000 


-0.008 


-0.047 


0.031 


B52 


0.000 


-0.020 


-0.043 


0.003 


B%2 


0.950 


0.937 


0.908 


0.965 


B72 


0.950 


0.918 


0.899 


0.938 


B^2 


0.000 


-0.006 


-0.030 


0.019 


B92 


0.000 


-0.008 


-0.022 


0.006 


B\0,2 


0.300 


0.298 


0.281 


0.315 




0.050 


0.052 


0.048 


0.057 


°l 


0.100 


0.099 


0.091 


0.108 




0.130 


0.124 


0.113 


0.135 


°l 


0.240 


0.230 


0.211 


0.251 




0.350 


0.333 


0.304 


0.362 




0.350 


0.345 


0.316 


0.377 




0.240 


0.253 


0.232 


0.276 


4 


0.130 


0.135 


0.124 


0.147 




0.100 


0.101 


0.092 


0.110 




0.050 


0.049 


0.045 


0.054 


Mi 


-0.200 


-0.092 


-0.280 


0.087 


M2 


-0.500 


-0.859 


-1.560 


-0.141 


4>i 


0.950 


0.945 


0.878 


0.981 


4>2 


0.980 


0.977 


0.960 


0.991 


(Trt,l 


0.100 


0.125 


0.076 


0.211 


&r),2 


0.270 


0.222 


0.168 
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Figure 1: Factor correlations and VaR estimates. The top panel shows the true correlation 
process pt (grey line) and its posterior mean pt (solid black line). The bottom panel shows VaR^ 1 
(grey line) and its posterior mean VaRf st (solid black line) . 

3.2 Model Comparison 

To complete the illustration, we finish the simulation study by comparing O-DCFMSV with the 
benchmark, the model of Philipov and Glickman (2006b, hereafter PG), as it is also a Wishart 
FMSV model with dynamic factor correlations. The model specification is: 

y t \B,f t ,Sl~N p (Bf t ,n), 
/ t |P t ~N,(0,P t ), 

Pf^p^St-i ~ w g (p-\\k,s t ^), 



where the matrix Pt is a factor covariance matrix, the meaning of the matrix A and the scalar 

parameters d and k are the same as those in O-DCFMSV. Here we define the scale matrix as 

_ i —i 

St = jP+ 2 AP+ 2 , whi c h is the form of ©. It should be noted that, as mentioned in Section [27TT 



Philipov and Glickman! (|2006bl ) use the BEKK-type specification (jSJ) for St, however, in order 



to remove the effect caused by different parameterizations, we adopt the setting ([5|) instead 
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Asai and McAleerl ([2009j) point out that it is possible to use 



1 of ([6]) for the competing model. 

_d 

2 either one or the other as an alternative. The matrix power P t 2 is calculated by the spectral 

3 decomposition. 

We take (i), (ii), and (hi) used in last section as the true data generation process (DGP) 
for O-DCFMSV. For the true DGP of PG's model, we drop (ii) and use only (i) and (hi), since 
the model does not assume SV structures on the factors. Two datasets with different DGPs are 
generated and fitted with both models. To evaluate the performance, we calculate the Kullback- 
Leibler (KL) divergence as a measure of how far away the distribution given the estimated 
covariance is from that given the truth. Let and Slf be the true and estimated covariance 
matrices, respectively. Let p® = p{yt\^l) denote the density of yt given true covariance matrix 
of yt, also, let p| st = p(yt\'St) be the density of yt with the estimated covariance matrix plugged 
in instead of the true covariance matrix. Under normality, the KL divergence between p® and 



p| st is 



KL $\\p?) = J PKV) log 

= -| + itr (t^E?) - \ log |E°| + \ log \%\. 

4 In each replication we record the mean KL divergence (MKL), which is defined by y Ylt KL (pt\ |p| st ) 

5 as a summary of the KL divergence over every t. 

6 Since we have two true DGPs and two models, there are four combinations, which, by 

7 the DGP-model order, are referred as O-O, O-PG, PG-O, and PG-PG. In each combination 

8 we conduct the simulation with 40 replications and record their MKL. In each replication we 

9 calculate the differenced measure by subtracting the value of the true model from that of the 

10 wrong model, i.e. [O-PG minus O-O] and [PG-0 minus PG-PG]. The differenced measure is 

11 denoted by AMKL. We report the sample mean of AMKL from the 40 replications and the 

12 standard error as the final summarized output. Table [2] summarizes the comparison results. We 

13 can see that, for both DGPs, the differenced values of [wrong minus true] are both significantly 

14 positive, which indicate that the true models win. However, we can find that the mean AMKL 
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Table 2: Results of AMKL for the comparison of O-DCFMSV with PCs model using simulated 
data. The difference is defined by the false model minus the true model. Std Err is the standard 
error of the mean. 



True DGP 


Mean AMKL 


Std Err 


O-DCFMSV 


0.012 


0.002 


PG 


0.007 


0.002 



appears to be larger in the case when the true DGP is O-DCFMSV. This has an important 
implication that, given both the models are misspecified, PCs model is more distant from the 
truth than O-DCFMSV; in other words, the KL loss is greater when PG is used and O-DCFMSV 
is correct, than vice versa. 

4 Empirical Study 
4.1 Factor Correlation 

This section uses two empirical examples to illustrate the applications of the O-DCFMSV model. 
We use three monthly Fama-French (F-F) factors obtained from Dr. Kenneth French's data 
library. The factors are the market excess return (Mkt), the Small-Minus-Big factor (SMB) 
and the High-Minus-Low factor (HML). All the factors are rescaled to (-1,1) by multiplying by 
0.01. The observation period is from July 1963 to December 2005 with a total sample size of 
510. Figure [2] shows the time series plot of the three rescaled F-F factors. We can see that, 
the behavior of the volatility is quite different among different factors. For example, during the 
1980's and the 90's, the volatility is small for SMB, whereas it appears to be large for Mkt and 
HML. In addition, we can observe that a cluster occurs around late 1990's to early 2000's in all 
three factors, but the magnitude of the volatility is noticeably larger in SMB and HML than in 
Mkt. Accordingly, we need to allow the factor volatilities to have separate dynamics in order to 
reflect such facts. 

Prior to the examples, we first display the time series plot of the "true" factor correlations to 
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1970 1960 1990 2000 



Figure 2: Time-series plot of the rescaled F-F factors, Jul 1963 - Dec 2005. 

1 show that the dynamic factor correlation is a reasonable setting. The "true" correlation at time 

2 t is pt-r ■. t+r ; which we take simply as the empirical correlation calculated from the data within 

3 the window [t — r, t + r]. For example, if we choose r = 3, then the correlation of (Mkt,SMB) 

4 at January 1980 is calculated by the empirical correlation of (Mkt,SMB) from October 1979 to 

5 April 1980, which approximately represents a half-year correlation. Here we choose r = 6 for 

6 calculating the 1-year correlation, r = 12 for the 2-year, and r = 18, the 3-year. By rolling the 

7 windows, we obtain the "true" correlations over time. 

8 Figure [3] shows the 1-year, 2-year, and 3-year pairwise correlations of the three factors. It 

9 is obvious that some pairs have quite large correlations during certain periods. The shaded 

10 areas account for the events having great economic impacts, which respectively are the first 

11 and second oil crisises, and the bursting of Dot-com bubble. Obviously, we can see that the 

12 factor correlations are changing over time. In particular, we can observe a common pattern: 

13 the correlations climb to a higher level or local peaks during these turbulent periods, while in 

14 the "calm" periods such as 1980's to 90's, the correlations decline to a relatively low level. This 

15 represents the well-known "correlation breakdown" phenomenon that has long been recognized 
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in empirical data, referring to the patter n that the correlation during ordinary and stressful 
market conditions differ substantially. See iRevI (|2000l ) for a detailed discussion. The correlation 



breakdown implied in Figure suggests that time- varying factor correlation should be considered 
in the modeling. We would also like to point out that the main advantage of using O-DCFMSV 
over the empirical rolling-window method to estimate factor correlations is that we can calculate 
the credible intervals, by which we can obtain the significance relative to a specific critical level. 




~i 1 1 r 

1970 1980 1990 2000 




~ i 1 1 r 

1970 1980 1990 2000 




i 1 1 r 

1970 1980 1990 2000 



1 1 

\\\\\\\\\\v 


- W\ 




1 

1970 1 


i i i 

980 1990 2000 


~s 1 


I I 


_ Wl 




i i i i 

1970 1980 1990 2000 


A 1 


i i 





i 1 r 

1970 1980 1990 2000 




i 1 1 r 

1970 1980 1990 2000 



- i 


L l 




1 



1970 1980 1990 2000 




— I 1 1 1 — 

1970 1980 1990 2000 



Figure 3: "True" correlations of the F-F factors Mkt-RF, SMB, and HML, Jul 1963 - Dec 2005. 
The shaded areas account for the events that have great economic impacts, which respectively 
are the first and the second oil crisis, and the Dot-com bubble burst. 



4.2 Example 1 — Portfolio Return Data 

In this first example we use the three F-F factors demonstrated in last section as the covariates. 
The return series Y are the monthly average value weighted returns for 10 industry portfolios 
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obtained from Dr. Kenneth French's data library. The 10 portfolios are: NoDur, Durbl, Manuf, 
Enrgy, HiTec, Telcm, Shops, Hlth, Utils, and Other. A detailed description for these portfolios 
can be found in the data library. Again, we first convert the data to a (-1,1) scale by multiplying 
by 0.01. Figure H] shows the time series plot of the data. We can observe some common clusters 
occurring in the mid 1970's and the early 2000's, which suggests that the factor SV structure 
can be useful. Figure \5\ shows the fitted factor correlations. The black lines are the fitted values 
and the grey lines represent the 2-year "true" factor correlations presented in Section 14.11 We 
see that the estimates are smoother than the "true" values, but in general, the movements and 
the magnitude of the correlations are properly captured. 




1970 1980 1990 2000 1970 1980 1990 2000 



Figure 4: Time-series plot of the ten portfolios, Jul 1963 - Dec 2005. 
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Figure 5: Estimated and "true" correlations of the factors for the portfolio data. The black lines 
are the fitted values and the grey lines are the "true" values. 

We now compare the O-DCFMSV model with PCs model using several performance mea- 
sures. The first two measures are based on the one-step-ahead predictive ability for the return 
covariance matrix. Notice that in this example we have 10 return series and in each month there 
are more than 10 transaction days. Thus, we can use daily returns to construct a nonsingular 
empirical covariance matrix as a proxy for the "true" covariance. Given the "true" covariance 
matrices, we can therefore compute MAEy a j^ for the equally-weighted portfolio as we do in 
Section [3) The empirical covariance matrix at month t, denoted by St, is simply the sample 
covariance matrix constructed from the daily observations within that month with an adjust- 
ment factor of nt/(nt — 1), where n% is the number of transaction days within month t. The 
one-step-ahead predictor for the return covariance matrix given a model can be obtained by the 
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conditional covariance: 



= Var {y t+ i\F t ; M) = BVar {f t+ i\F t - M) B' + ft, 

where T% = {yi,---,yt} is the set of observations collected up to time t, and M. denotes the 
model, either PG or O-DCFMSV. In the implementation, for each period t + 1 we rerun the 
MCMC procedure to obtain the one-step-ahead covariance matrix. Let be the one-step- 

ahead predictive covariance matrix of yt+i, which is estimated by 

i M 

^4EK* (1) '+ fi(i) * 
1=1 

i where M is the number of preserved MCMC iterations and P^x ~ -Pi+i| r est; A4pc- 
For the O-DCFMSV model, the factor follows: 

f t+ x\V t+ x,-E e ,t+l ~ N q (0,V t ^V ejt+1 V^). 

1/2 1/2 

Define R t +x = V t ^ £ e ,m^+i , where V t+ x = diag(14+i i i, V t +x,q)- From ((2c]), we have that 

V t+ x,i\hti,^i = exp(ht+x,i)\hu, Ui ~ logN(V^ + i ) t|A ti , cr^), where (12a) 

hi = 4>ihi + (l - 4>i)m, 
Vl^hi,^ ~ logNC^I^IAtt/2,4,/4), (12b) 

and logN(x|a, 6) is a log normal density in x where logx ~ N(logx|a, b). The correlation matrix 

S ei t+i is obtained by @. Then, in the Ith. MCMC iteration we can calculate R^lx- Let 

be the one step ahead predictive variance of yt+x under the O-DCFMSV model. Given Hj+i, 
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we have the approximation 



i M 

=&i«^E( B(0 ^i Wi,(0 ' + n(0 ) 
1=1 



To evaluate the predictive accuracy for the 5% VaR predictions for the equally-weighted 
portfolio, we calculate 

MAEvaR = i I VaR tti - VaR t ° +1 , 



where N is the number of forecast periods and the quantity VaR^ +1 = 1.645- (w'^t+iw) 1 ^ 2 with 
w = p^ 1 ! being the weight vector. We calculate the estimate VaR^ using 



VaR|f x = 1.645 x M 



M 

i=i 



w 



(I) 



In addition to the MAEyaR measure, some authors suggest calculating the differ ence between 
the "t rue" and predicted covariance matrices in an elementwise sense. Following 



Ledoitetal 



(|2003l ). we calculate the root-mean-square error (RMSE) based on the Frobenius norm (FN): 



FN 



~iM I 
J t+l\ 



N ^ 



t+l\ij 



'■J 



1/2 



1 Because both MAEyaR and FN measure deviations from "true" values, a smaller value indicates 

2 a better model. Here we calculate the ratio of PG to O-DCFMSV for these measures so that 

3 we can compare the deviations. We output the mean values of the ratios over all the prediction 

4 periods as the summarized results. 

Besides using the empirical covariance error-based measures, we also evaluat e model perfor- 



mance in terms of the predictive quality for the return series. To do this, following 



Geweke and Amisano 



(|2010l ). we first obtain the one-step-ahead log predictive score (LPS) and then calculate the cu- 



mulative log predictive Bayes factor. The one-step-ahead LPS evaluated at yt+i under the 
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specific model M is given by 



L-pS{y t+1 \F t ;M) = ^gp{y t+1 \F t ;M), 



where the predictive density p(yt+i\J r t] M) is calculated by 



p{yt+i\FuM) = j p{yt+i\^t,o M )p(e M \F t )de M 
1 M 

1=1 

M 

= FE N ,(^m|^)/S 1 ,^ ) ;-M), 



1=1 

where 9m is the set of parameters for the model M. and x t +\ is the latent state vector. Then 
we can calculate the cumulative log predictive Bayes factor of Model 1 against Model 0, which 
is defined by 

log( J Bi ! o) = ^2\ogp(y t \T t -i;Mi) - ^ logp(y t |.F t _i; M ) 



[hPS(y t \Tt-i;Mi)-LPS(y t \T t -i;Mo) 



In addition, we calculate the LPS for the equally- weighted portfolio w'yt+i, say, LPS-EW: 



LPS-EW (w'y t+1 \T t ,M) = log P(w'y t+1 \F t ,M) 



log 



1 M 

i=i 



w 



The reason to calculate LPS-EW is that if the model performs better in this measure, then we 
have evidence to believe that the model should also be better in forecasting the VaR for an 
equally-weighted portfolio. In this sense, we can regard LPS-EW as an alternative to MAEy a R- 
Similar to LPS, we then calculate the cumulative log predictive Bayes factor of Model 1 against 
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Model for the equally-weighted portfolio: 



log(Bf^) = ^ [LPS-EW(y t |^_i;Mi) - LPS-EW(j/ t |Ji_i;M ) 



The cumulative log pre dictive Bayes factor has a si mple criterion for checking statistical sig- 
nificance. According to 



scoring rule descr i bed in 



Kass and Rafterv 



Geweke and Amisano (2010), the evaluation is conducted via the log 



Gneiting and Raftervi (|20071 ). The detailed log scoring rule is given in 



(|1995l ). of which we use the following criterion: if log(.Bi o) < 0, the evidence 
is in favor of Model 0; if log(-Bi ) o) £ [0, 1), the evidence is not worth more than a bare mention; 
if log(Bi j o) € [1)3), the evidence is positive in favor of Model 1; if log(i?i ) o) € [3,5), the evidence 
is strongly in favor of Model 1; if log(i?i i o) > 5, we have very strong evidence in favor of Model 
1. 

We use a three-year out-of-sample prediction period, from January 2006 to December 2008, 
with a total length N = 36. This time frame covers two market conditions: before 2007, 
when the market is relatively calm, and afterwards when the market is relatively volatile due 
to the subprime crisis. Therefore, we can compare model performance across different market 
conditions. Here, the one-step-ahead prediction is conducted on a rolling basis, i.e., if we use 
observations j/i, ...,j/t to forecast j/t+i> then in next period, JJt+i is included as a sample 
for the prediction of Ut+2- Table [3] (a) summarizes the results of the comparison using the 
empirical covariance error-based measures. R-MAEyaR and R-FN denote the ratio of PG to 
O-DCFMSV for MAE Va R and FN, respectively. We see that the mean ratio for MAE Va R is 
1.22, suggesting that the deviance of PG is 22% larger than that of O-DCFMSV, which is a 
considerable difference. The mean ratio for FN is 1.19, which also suggests a considerably large 
difference. Table [3] (b) is the summarized comparison results using cumulative log predictive 
Bayes factors. \og{Bo,pc) and \og{BQ V p G ) respectively denote the cumulative log predictive 
Bayes factor of O-DCFMSV against PG for y t+ i and w'y t +i. We see that log(So,PG) = 14.94 > 



5, suggesting very strong evidence in favor of O-DCFMSV. Similarly, for the equally-weighted 
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Table 3: Results for the comparison of O-DCFMSV with PCs model using F-F portfolio data. 
R-MAEy aR and R-FN are the ratios of PG to O-DCFMSV in MAEy aR and FN, respectively. 

(a) Empirical error-based measures. (b) cumulative log predictive Bayes factor. 



Measure 


Mean 


Measure 


Value 


R-MAEy aR 
R-FN 


1.218 
1.191 


log(-B 0) PG) 


14.940 
13.176 



portfolio, we have log(-BQp G ) = 13.18 > 5, which again strongly supports O-DCFMSV. In 
conclusion, all the evidence strongly suggests that O-DCFMSV outperforms PCs model. 

The results in Table [3] (a) and (b) are aggregated over the prediction period and do not 
show how the two models perform at each time point. To be more convincing, we examine the 
"periodwise" performance. Figure [6] shows the period-by-period results for MAEyaR and FN. It 
is readily seen that the values of O-DCFMSV are constantly smaller than those of PCs. Figure[7] 
displays the period-by-period plot of the difference in LPS and LPS-EW (O-DCFMSV minus 
PG). From Figure [7] we observe that the differences in LPS and LPS-EW of O-DCFMSV minus 
PG are constantly greater than 0, which shows that the LPS and LPS-EW of O-DCFMSV are 
constantly larger than those of PCs model. The only noticeable drop occurs at October 2008, 
which is the month right after the bankruptcy filing of Lehman Brothers and the bailout of Fannie 
Mae and Freddie Mac. The market was extremely volatile at that time. In fact, according to 
the result not shown here, the values of the predictive density functions for both models are only 
about exp(— 87) to exp(— 86), which suggests that in such an extreme environment, both models 
appear to be equally unlikely (i.e., they do not hold). Consequently, the result at this time point 
is arguably unrepresentative. Overall, based on these results, we conclude that O-DCFMSV 
generally performs better than the PG model in terms of one-step-ahead prediction. 

4.3 Example 2 — Stock Return Data 

The second example fits the monthly stock return data. We collect 20 historical stock prices from 
Yahoo! Finance. The observation period is January 1977 - June 2007, with 366 observations. We 
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Figure 6: Period-by-period comparison using MAEyaR and FN. The portfolio return data. 




Jan-06 Mar-07 May-08 Jan-06 Mar-07 May-08 



Figure 7: Difference of the predictive log-likelihood for the returns and for the equally-weighted 
portfolio returns. The portfolio return data. 



calculate the stock returns by taking log Ptj — log Pt-i,j, t = 2, 366 and j = 1, 20, where 
Ptj is the price of the jth stock at time t. This generates a set of return data with a sample size 



T = 365. Similar to the example given in 



Philipov and Glickmanl (|2006bl ). in this illustration we 



model the stock returns using two pairs of factors, (Mkt, SMB) and (Mkt, HML), respectively. 
We also compare O-DCFMSV to PG based on the one-step-ahead prediction quality. The out- 
of-sample period is again January 2006 - December 2008. Notice that in this case we have 20 
stock returns, but during the sample period not every month has at least 20 transaction days; 
for this reason, unlike Example 1, here we do not calculate the empirical covariance error-based 
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Table 4: Results for the comparison of O-DCFMSV with PG using stock return data. 



Measure 


(Mkt,SMB) 


(Mkt,HML) 


log(B ,PG) 


7.200 


7.272 


log(^,P G ) 


6.408 


6.480 



measures. 



Table |4] shows the aggregate results, from which we can see that, no matter what pair of 
factors is used, the cumulative log predictive Bayes factors log (_Bq*p G ) and log(Bo,pc) are both 
greater than 5, suggesting very strong evidence in favor of the O-DCFMSV model. Figure [8] 
and Figure [9] respectively display the period-by-period differences in LPS and LPS-EW (O- 
DCFMSV minus PG) for the pairs (Mkt, SMB) and (Mkt, HML). As one can see, similar to 
what we observe in Example 1, the differenced values are uniformly greater than in both cases 
except the one at October 2008. The reason is the same. At this month, no matter what pair 
of factors is used, the values of the predictive density functions for both models are as low as 
exp(— 102) to exp(— 101). Clearly, both models fail to work under such a market condition. 
Thus, again, we argue that the differences obtained at this time point may not be meaningful. 
Regardless of this outlier, O-DCFMSV performs uniformly better over the out-of-sample period. 

Another natural question to ask is which of the two combinations of factors provides a better 



explanation to the data. This is a model selection question which in its generality a s 



Philipov and Glickmanl (|2006bl ). Our 



cs how many 



and which factors should be used and is not discussed in 
solution is straightforward. We can simply compare the predictive performance of the candidate 
models using the cumulative log predictive Bayes factor. For instance, in this illustration we 
have two O-DCFMSV models, one with the factors (Mkt, SMB), denoted by MS, and the other 
with (Mkt, HML), MH. Table [5] summarizes the comparison result of the two models in terms of 
log(BMS,MH) and \og{BfJg MH ). We see that, both of the cumulative log predictive Bayes scores 
show positive (but not strong) evidence in favor of (Mkt, SMB); therefore, we may conclude that, 
for a two-factor O-DCFMSV model, (Mkt, SMB) is a better choice for the data. Furthermore, 
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Figure 8: Difference of the predictive log-likelihood for the returns and for the equally-weighted 
portfolio returns. (Mkt, SMB). The stock return data. 




Figure 9: Difference of the predictive log-likelihood for the returns and for the equally-weighted 
portfolio returns. (Mkt, HML). The stock return data. 



1 if we have models that contain different numbers of factors, we can also use this approach to 

2 select the "best" model or the optimal number of factors. 



5 SV on the Errors 



4 It is commonly seen in financial studies that daily and weekly data exhibit more volatility than 



5 monthly data. For this reason, we may co nsider allowing each of th e idi osyncratic e r rors t o 

~2 



6 follow an independent SV process, see, e.g. 



Pitt and Shephardl (|1999h and 



Chib et al 



(2006). 
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Table 5: The model comparison result using stock return data. The models are O-DCFMSV 
given (Mkt,SMB) and (Mkt,HML) as the covariates. 



Measure Value 
\og{B MS ,MH) 1^269 
l°g«*W) 1-187 



We modify the model form ([TJ as 



y t = Bf t + e t , e t = Al /2 u t , 



(13) 



where ft and e t are independent and u t ~ N p (0, I). The scaling matrix A t = diag [e htl , e htp ). 
where {htj,j = 1, ■•■,!>} is the log-volatility of the error terms following the SV process: 



h tj = [ij + 4>j{h tj - fxj) + rftj, Vtj ~ N(0, o^ j). 



Note that, in previous sections we use the index i = l,...,q for the SV processes of the factor 
volatilities. Here, for the log-volatilities of the errors, we use the index j = 1, p. With the 



spec ification (1131) we need to chan ge the sampling scheme for B. Following 



Geweke and Zhou 



(119961 ) and 



Lopes and Westl (|2004l ). we set the priors bj ~ N g (0, c^-I q )^ where bj is the jth row 



of B. Throughout we choose CQj = 5, which defines an uninformative prior for each bj. Let yj 
be the jth. column of Y and hj = {hij, ...,htj}. Given the likelihood function: 



C (bj | yj ,hj,F) oc. exp 



1 



yj-FbjyAfiyj-Fbj) 



where Aj = diag (e hlj , e htj ). The conditional posterior for bj is given by: 



1 



P(bj\rest) oc exp <J - ^ [(bj - Hbj)^ (bj - A*6 f ) 
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where T, b] = (c 2 / + F'A- 1 F) 1 and n h . = H hj F'A j Fo r {fij, <f>j, a n j} and h t j, in a same 



L 0.y 

2 manner, we use the integration sampler of 



Kim et al 



1998) to make draws. The sampling 



3 scheme for the other parameters remains unchanged. 

4 To know how much we gain from adding individual SV processes on the idiosyncratic error 

5 terms, using both monthly and daily data, we compare the O-DCFMSV with SV on the errors 

6 (SV-Err) to the original model, O-DCFMSV. The model performance is compared in terms of 

7 the cumulative log predictive Bayes factors. The first comparison is based on monthly data, 

8 where we use the same dataset as in the first example of Section 14.21 Similarly, the sample 

9 period is July 1963 - December 2005 and the out-of-sample period is three-years long covering 

10 January 2006 - December 2008, with a length N = 36. Table [6ja) summarizes the results of 

11 the comparison. We can see that for the monthly portfolio data, O-DCFMSV beats SV-Err 

12 in predicting yt+i but not in the equally- weighted portfolio w'yt+i- However, if we examine 

13 Figure PTOl the period-by-period prediction results, we readily see that, in general O-DCFMSV 

14 performs better than or as well as SV-Err does. Nonetheless, just as we observed in Section \A. 2\ 

15 the O-DCFMSV fails to capture the movement in returns in the single extreme period, October 

16 2008. This failure overrides those superior performances at other periods and does not reflect 

17 the overall performance of the two models. Except for this extreme case, we should agree that 
is O-DCFMSV suffices to model the monthly data. 

19 In the second comparison, both models are fitted to daily data. The dataset contains 30 

20 daily stock prices collected from Yahoo Finance. The sample period is from January 3, 2006 

21 to July 31, 2008. We calculate the stock returns as described in Example 2. This gives us the 

22 return data of a sample size T = 649. The out-of-sample period covers one full month, August 

23 1 - August 29, 2008, with a total length N = 21. Table [6^b) shows the summarized result. 

24 Obviously, we see that the evidence is in favor of SV-Err since both cumulative log predictive 

25 Bayes factors are smaller than 0. The result suggests that, for daily data the SV-Err model 

26 should be considered. 
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Table 6: Comparison results for SV-Err (S) vs. O-DCFMSV (O). 
(a) Monthly portfolio data. (b) Daily stock return data. 
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Figure 10: Comparison of SV-Err and Non-SV-Err using monthly portfolio data. 

6 Conclusion and Discussion 

In this paper we propose a dynamic-correlation FMSV model where the factors are observable. 
The novelty is that we simultaneously allow the factors to have separate SV processes and the 
factor covariance process follow an inverse Wishart process, which provides great flexibility to 
describe the dynamics of the factors. We also develop an algorithm based on a full MCMC 
procedure to estimate the model. A significant advantage of the algorithm is it makes feasible 
to carry out prediction and model selection. This is an important improvement in this context, 
as it enlarges the scope of applications of the WIC-type models. From the comparisons using 
simulated data and various empirical data, we show that O-DCFMSV outperforms the competing 
model, PCs FMSV model. 

Moreover, we also consider the SV-Err setting that is adopted by many authors. Our em- 
pirical result shows that, for monthly data, the basic O-DCFMSV suffices; for daily data which 
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contain more volatility, we may need to consider the SV process on the error term. The model 



is easy to be extended to a 
specification suggested by 



low for heayy-tai ledness. For example, we can add the adhoc scaled-i 



Kimetal 



(|1998l ) to the idiosyncratic errors. A possible future work 
is to allow the factors to be latent so that the model can be more flexible. When it comes to 
the latent factor structure, many issues need to resolve, such as the nonidentification problem, 



the choice of the number of factors, and so on. We expect further research on this direction. 
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